首页> 外文OA文献 >“Semantics Inside!” But let’s not tell the Data Miners: Intelligent Support for Data Mining
【2h】

“Semantics Inside!” But let’s not tell the Data Miners: Intelligent Support for Data Mining

机译:“内部语义!”但是,我们不要告诉数据挖掘者:数据挖掘的智能支持

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Knowledge Discovery in Databases (KDD) has evolved sig- nificantly over the past years and reached a mature stage offering plenty of operators to solve complex data analysis tasks. User support for build- ing data analysis workflows, however, has not progressed sufficiently: the large number of operators currently available in KDD systems and in- teractions between these operators complicates successful data analysis. To help Data Miners we enhanced one of the most used open source data mining tools—RapidMiner—with semantic technologies. Specifi- cally, we first annotated all elements involved in the Data Mining (DM) process—the data, the operators, models, data mining tasks, and KDD workflows—semantically using our eProPlan modelling tool that allows to describe operators and build a task/method decomposition grammar to specify the desired workflows embedded in an ontology. Second, we enhanced RapidMiner to employ these semantic annotations to actively support data analysts. Third, we built an Intelligent Discovery Assistant, eIda, that leverages the semantic annotation as well as HTN planning to automatically support KDD process generation. We found that the use of Semantic Web approaches and technologies in the KDD domain helped us to lower the barrier to data analysis. We also found that using a generic ontology editor overwhelmed KDD-centric users. We, therefore, provided them with problem-centric extensions to Protege. Last and most surprising, we found that our semantic modeling of the KDD domain served as a rapid prototyping approach for several hard-coded improvements of RapidMiner, namely correctness checking of workflows and quick-fixes, reinforcing the finding that even a little semantic modeling can go a long way in improving the understanding of a domain even for domain experts.
机译:过去几年中,数据库知识发现(KDD)有了显着发展,并达到了成熟的阶段,为大量的操作员提供了解决复杂数据分析任务的方法。但是,用户对构建数据分析工作流的支持还不够充分:KDD系统中当前有大量操作员,这些操作员之间的交互使成功的数据分析变得复杂。为了帮助数据挖掘者,我们使用语义技术增强了最常用的开源数据挖掘工具之一——RapidMiner。具体来说,我们首先使用eProPlan建模工具为数据挖掘(DM)过程涉及的所有元素(数据,操作员,模型,数据挖掘任务和KDD工作流程)添加注释,该工具可以描述操作员并构建任务/ method分解语​​法,以指定嵌入到本体中的所需工作流。其次,我们增强了RapidMiner以使用这些语义注释来积极支持数据分析人员。第三,我们构建了一个智能发现助手eIda,它利用语义注释和HTN计划自动支持KDD流程生成。我们发现,在KDD域中使用语义Web方法和技术有助于我们降低数据分析的障碍。我们还发现,使用通用本体编辑器使以KDD为中心的用户不知所措。因此,我们为他们提供了以问题为中心的Protege扩展。最后,也是最令人惊讶的是,我们发现KDD域的语义建模是RapidMiner几个硬编码改进的快速原型制作方法,即工作流和快速修复的正确性检查,从而强化了即使一点点语义建模也可以即使对于领域专家,也可以极大地增进对领域的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号